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Expression vectors, transformed host c ells and fermentation process for the production of 
recombinant polypeptides 

Tiie present invention relates to an expression vector, comprising a polynucleotide encoding 
a fusion protein comprising the signal sequence of the gac gene of Pseudomonas diminuta 
and a polypeptide of interest, a prol<aryotlc host cell transfomned with such an expression 
vector and a process for producUon of a polypeptide of interest using said host ceil and said 
expression vector. 

It is a subject of the present invention to provide a process for the efficient and direct 
production of a mature recombinant polypeptide in a prokaryotic host cell. The process 
according to the present invention can, for example, favourably be used for the production of 
recombinant human Interferon alpha 2B (rhlFNot2B) in Escherichia coii (E. coll). 

In the production of recombinant proteins In prokaryotic microorganisms such as the 
expression of human or other eukaryotic proteins in bacterial cells it is often difficult to obtain 
a cleariy defined N-tennlnus which is as neariy 100% homogeneous as possible. This 
applies in particular to recombinant phamiaceutlcal proteins whose amino acid sequence 
ought in many cases to be identical to the amino acid sequence naturally occurring in 
humans/animals. Any Inhomogeneity or deviation from the natural sequence is, however, 
unacceptable in many cases because these products frequently show different 
Immunological (for example Induction of antibody formation) and pharmacological (half-life, 
phamriacokinetlcs) properties. For these reasons, it is necessary in most cases to produce a 
nature-identical product (homogeneous and without foreign amino acids at the N-temiinus). 

On natural expression, for example in humans, many phamiaceutlcal proteins which are in 
use are transported into the extracellular space, and cleavage of the signal sequence 
present in the precursor protein for this purpose results In a clearly defined N-terminus. Such 
a homogeneous N-temilnus is not always easy to produce, for example in bacterial cells, for 
several reasons. 



The synthesis of all cytoplasmic proteins in prokaryotic microorganisms starts with a 
methionine due to the start codon ATG, which Is both a translation initiation site and the 
codon for methionine. Depending on the structure of the second amino acid after the 
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N-terminal methionine, tiiis methionine may be cleaved off by a host cell methionine- 
amlnopeptidase (MAP) leading to a mixture of a product starting either with Met or with the 
second amino acid. The separation of these two species Is very difficult and leads to 
reduced yields. In recombinant cytoplasmic production of a polypeptide It is usual for a not 
inconsiderable proportion (1-50%) of the polypeptide to remain unaffected by the MAP. 
Therefore, production of recombinant polypeptides using a cytoplasmic Met-expresslon 
system Is often highly unfavorable. 

Another possibility to produce a mature recombinant polypeptide via a cytoplasmic pathway 
Is the production of an N-termlnal fusion protein with subsequent chemical or enzymatic in 
vitro cleavage. However, In many cases N-termlnus of the fusion protein is not easily 
accessible to enzymatic cleavage leading to low cleavage rates or no cleavage at all. This 
may be due to the N-tenninus being structurally Inaccessible. 

In addition, recombinant proteins are expressed in the cytoplasm of prokaryotic 
microorganisms in a reduced state with the effect that disulphlde bonds often necessary for 
correct protein folding and function are not fomied or not fonned con^ctly. Recombinant 
polypeptides containing disulphlde bonds therefore may necessitate difficile in vitm 
oxidation. 

Also, cytoplasmic expression leads often to the fomiatlon of Inclusion bodies, which have to 
be solubilised under denaturing conditions followed by a refold to the native structure prior to 
the actual purification process. 

Periplasmic expression, on the other hand, could directly yield a product with the desired 
properties: 

(1) Con-ect, mature N-temiinus by cleaving off of a periplasmic signal sequence by the host 

cell's signal peptidase apparatus 
(li) Soluble expression due to correct folding 

(iii) Con-ect disulphide bond formation due to oxidative milieu in the periplasm very similar 
to the one found In the human extracellular fluid (where this molecule Is naturally found) 
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Accordingly, it is tlierefore an object of the present invention to provide a periplasmic 
expression system wlilcli is suitable for the production of a recombinant polypeptide of 
interest in a prokaryotic host cell. 

It has now surprisingly been found within the context of the present invention that an 
expression vector encoding the signal sequence of the glutaryl 7-ACA acylase gene (gac 
gene) of Pseudomonas diminuta and a polypeptide of interest is particularly suitable for use 
in a process for the recombinant production of the polypeptide. 

In one aspect, the present invention thus relates to an expression vector comprising a 
polynucleotide encoding a fusion protein comprising the signal sequence of the gac gene of 
Pseudomonas diminuta and a polypeptide of interest other than gac of Pseudomonas 
diminuta, wherein said signal sequence and said polypeptide of interest are linked in such a 
way that upon expression of the polynucleotide In a suitable host cell the signal sequence is 
cleaved off the fusion protein and the polypeptide of interest is released into the periplasm of 
the host cell. 

In accordance with the present Invention, a variety of polypeptides of interest can be 
produced by utilization of the expression vector. For example, the polypeptide of interest can 
be selected from the group consisting of an Interferon, an interieukin, a growth hormone, a 
growth factor, a cytokine, an enzyme, an enzyme inhibitor, an antibody and an antibody 
fragment, and the like, for example interferon alpha 2A. Interferon alpha 2B. interieukln-3. 
interieukin-6. human growth hormone. Insulin, granuiocyte-colony stimulating factor, 
granulocyte macrophage-colony stimulating factor, macrophage-colony stimulating factor, 
interferon beta 1, bovine somatropin, porcine somatropin, interieukln-11, interleukln-2, a ' 
Fab-fragment, and small peptides such as calcitonin, parathyroid hormone (PTH). or a 
glucagon. Preferably, within the scope of the present invention, the polypeptide of interest is 
a recombinant human interferon 2, in particular human Interferon alpha 2A or human 
interferon alpha 2B, the latter being particulariy preferred to be the polypeptide of interest. 

With respect to the polynucleotide encoding the part of the fusion protein which is the 
polypeptide of interest, a cDNA or a synthetic polynucleotide can be used, if a given cDNA or 
synthetic polynucleotide conesponding to a natural occurring gene sequence will be 
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expressed only poorly, the structure of the cDNA or synthetic polynucleotide may be adapted 
to the respective host cell by codon optimization. 

In this respect, within a preferred embodiment of the present invention, the polynucleotide 
encoding the polypeptide of interest being rhlFNa2B, will comprise the following nucleotide 
sequence (SEQ ID NO 1) 



5*-TGC GAT CTG CCG CAA ACC CAC AGC CTG GGT AGC CGG CGA ACC TTG 



ATG 


CTT 


CTG 


GCA 


CAG 


ATG 


CGG 


CGA 


ATC 


TCT 


CTT 


TTC 


TCT 


TGC 


TTA 


AAG 


GAT 


CGA 


CAT 


GAC 


TTC 


GGT 


TTC 


CCG 


CAG 


GAG 


GAG 


TTC 


GGT 


AAC 


CAG 


TTC 


CAA 


AAG 


GCT 


GAA 


ACC 


ATC 


CCG 


GTA 


TTG 


CAT 


GAG 


ATG 


ATC 


CAG 


CAG 


ATC 


TTC 


AAC 


CTG 


TTC 


AGC 


ACT 


AAG 


GAC 


TCT 


TCT 


GCT 


GCT 


TGG 


GAT 


GAG 


ACC 


CTG 


CTT 


GAC 


AAA 


TTC 


TAC 


ACT 


GAA 


CTG 


TAC 


CAG 


CAG 


CTG 


RAC 


GAC 


CTG 


GAA 


GCC 


TGC 


GTG 


ATC 


CAG 


GGT 


GTG 


GGT 


GTG 


ACT 


GAG 


ACT 


CCG 


CTG 


ATG 


AAG 


GAG 


GAC 


TCT 


ATT 


CTG 


GCT 


GTG 


CGA 


AAA 


TAC 


TTC 


CAA 


CGG 


ATC 


ACT 


CTG 


TAT 


CTG 


AAA 


GAG 


AAG 


AAA 


TAC 


AGC 


CCG 


TGC 


GCC 


TGG 


GAG 


GTT 


GTC 


CGA 


GCA 


GAA 


ATC 


ATG 


CGG 


TCT 


TTC 


TCT 


TTG 


TCT 


ACC 


AAC 


TTG 


CAA 


GAA 


TCT 


TTA 


CGA 


AGC 


AAG 


GAA-3 ' , 



where 48 out of 165 codons of the natural occurring human interferon alpha 2B have been 
altered with respect to the nucleotide sequence without changing the amino acid sequence. 

With respect to the Pseudomonas dimlnuta strain, any Pseudomonas dimlnuta strain which 
exhibits a glutaryl-7-aminocephalosporic acid acylase activity will bear a gac gene encoding 
a suitable signal sequence. 

For example, such a Pseudomonas diminuta strain has been described in Czech Patent No. 
CZ 278.515 under designation no. COM 3987. 
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Such a Pseudomonas diminuta strain bears the gene encoding the enzyme glutaryi-7-ACA- 
acylase, abbreviated gac. 

A preferred embodiment of the present invention reiates to the expression vector according 
to the present invention, wherein said signal sequence of the gac gene of Pseudomonas 
diminuta, forming part of the said fusion protein, comprises the amino acid sequence (SEQ 
ID NO 2) 

MLRVLHRAAS ALVMATVI GLAPAVAFA • 

For example, such amino acid sequence may be encoded by the following polynucleotide 
sequence (SEQ ID NO 3) 

5»-ATG CTG AGA GTT CTG CAC CGG GCG GCG TCC GCC TTG GTT ATG GCG 
ACT GTG ATC GGC CTT GCG CCC GCC GTC GCC TTT GCG-3' 

Likewise, polynucleotide sequence where nucleotides have been altered as to create 
restriction enzyme site can be used as long as such mutations are silent, i.e. they do not 
change the amino acid sequence of the gas signal sequence as outlined above. 

Accordingly, another example for a suitable polynucleotide sequence encoding the gac 
signal sequence may be (SEQ ID NO 4) 

5' -ATG CTG AGA GTT CTG CAC CGG GCG GCG TCC GCC TTG GTT ATG GCG 
ACT GTG ATC GGC CTT GCG CCC GCG GTC GCC TTT GCG-3 ' , 

wherein, compared to the polynucleotide sequence mentioned above, a single mutation 
(C->G) has been .introduced with respect to the last nucleotide of codon 23, in order to 
obtain a restriction enzyme site (Sac II site) which may be used in further cloning. 

Another preferred embodiment of the present invention relates to the expression vector 
according to the present invention, wherein said vector is a plasmid. A low (about 1-10 
copies per cell), medium (about 10-50 copies per cell) or high (about > 50 copies per cell) 
copy plasmid may be used. These definitions apply if such a plasmid will be used in a 
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favourable environment, and such copy numbers may be lower under certain fermentation 
conditions, e.g. under reduced temperature conditions. For strong inducible systems a 
medium copy basic replicon (e.g. pBR322) may be adequate, for strong basal expression 
from low or medium strength expression elements (promoter, RBS, structural gene) a high 
copy replicon (e.g. plasmids from the pUC series) may be better suited. In the context of the 
present invention, In particular with respect to production of rhlFNa2B, a high-copy plasmid 
is preferably used. Accordingly, a further prefen-ed embodiment of the present invention 
relates to an expression vector according to the present invention, wherein said vector is a 
high copy plasmid. 

Further elements of the expression vector according to the present invention comprise 
transcription and translation elements, In particular a promoter and a ribosomal binding site 
(RBS). 

For periplasmic expression, the ribosomal binding site should neither be too strong nor too 
weak. The first could lead to an overstrain of the periplasmic protein export apparatus 
(translocase etc.) and the deposition of (often Insoluble) uncleaved fusion protein in the 
cytoplasm. The second could lead to poor protein yields. The situation on the transcriptional 
level (promoter) Is similar to the one on the translational level (RBS). Transcription should 
neither be too strong nor to weak in order to avoid the described phenomena. Also, the 
sudden onset of protein production when using an inducible promoter may cause problems 
with "clogging" the export apparatus or will at least require extensive fine-tuning of the 
Induction step and fermentation parameters. This will often lead to non-robust processes. 

It has been found that in the context of the present invention the promoter region and the 
ribosomal binding site of the gac gene of Pseudomonas diminuta are partlculariy suitable for 
serving as transcriptional and translational regulatory elements. 

Therefore, a prefenred embodiment comprises an expression vector according to the present 
invention, wherein said vector further comprises a polynucleotide comprising the promoter 
region and the ribosomal binding site of the gac gene of Pseudomonas diminuta, which 
polynucleotide being operatively llnl<ed to the polynucleotide encoding the fusion protein 
comprising the signal sequence and the polypeptide of Interest. 
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In a further preferred embodiment thereof said polynucleotide comprising the promoter 
region and the ribosomal binding site comprises the nucleotide sequence (SEQ ID NO 5) 

5 * -ATCCTGGTTCGTACGCGCCGCCTACAAGTGGTGATCTAGGGGAACGTTCCGGGGGCGTCG 
CTGCAACGGCGTCTCCGGATCTGGGTGAGAGGGGAAATCC-3' 

in a yet further preferred embodiment thereof said polynucleotide comprising the promoter 
region and the ribosomal binding site* comprises the nucleotide sequence (SEQ ID NO 6) 

5 ' -TCTAGACCAACAACATCTTCAACGTCTACCCGACCAAGATTCAGGAGCCGTCGGCCGAC 
CTGGGCAATGGGATGTACAGCGGGCTTGCGCCGTTCGGCTTCACCGGCGGATCCTGGTTCGT 
ACGCGCCGCCTACAAGTGGTGATCTAGGGGAACGTTCCGGGGGCGTCGCTGCAACGGCGTCT 
CCGGATCTGGGTGAGAGGGGAAATCC-3' . 

Further elements may be present in an expression vector according to the present invention, 
as appropriate. 

For example, an expression vector according to the present invention may comprise a 
polynucleotide comprising one or more transcription terminator(s), 

Lil<ewise, an expression vector according to the present invention may comprise a 
polynucleotide encoding one or more selectable marl<ers, e.g. to provide for antibiotic 
resistance of a transformed host cell. Suitable selectable mari<ers are widely known in the 
art. In the context of the present invention, the expression vector favourably comprises a 
polynucleotide comprising a tetracycline resistance gene. 

Additionally, where suitable, further regulatory elements may be present on an expression 
vector according to the present invention. Regulatory elements are widely l^nown in the art, 
like a repressor or an enhancer. 

In a further aspect, the present invention relates to prokaryotic host cells which are 
transformed with an expression vector according to the present invention in order to be 
capable of bring about expression of the polypeptide of interest. 



Therefore, the present invention relates to a prokaryotic host cell transformed witli an 
expression vector which is compatible with the host ceil, said vector comprising a 



wo 2005/017162 



PCT/EP2004/009067 



-8- 

poiynucleotide encoding a fusion protein comprising the signal sequence of the gac gene of 
Pseudomonas diminuta and a polypeptide of interest other than gao of Pseudomonas 
diminuta, wherein said signal sequence and said polypeptide of interest are linked in such a 
way that upon expression of the polynucleotide in a suitable host cell the signal sequence is 
cleaved ofFthe fusion protein and the polypeptide of interest is released into the periplasm of 
the host cell. 

Examples for suitable polypeptides of interest are those as mentioned above. A prefen*ed 
embodiment thereof relates to such a prokaryotic host cell, wherein the polypeptide of 
interest is an Interferon alpha 2. In particular, such interferon alpha 2 is selected from the 
group consisting of interferon alpha 2A and Interferon alpha 2B, the latter being prefen-ed. 

Another preferred embodiment thereof relates to a host cell according to the present 
invention, wherein said vector is a plasmid, preferably a high copy plasmid. 

The present Invention further relates to a host cell according to the present invention, 
wherein said signal sequence of the gac gene of Pseudomonas diminuta comprises the 
amino acid sequence (SEQ ID NO 2) 

MLRVLHRAAS AL VMATVI GLAPAVAFA . 

In a further aspect, the present invention relates to the host cell according to the present 
invention, wherein vector further comprises a polynucleotide comprising the promoter region 
and the ribosomal binding site of the gac gene of Pseudomonas diminuta, which 
polynucleotide being operatively linked to the polynucleotide encoding the fusion protein 
comprising the signal sequence and the polypeptide of interest. 

Such polynucleotide comprising the promoter region and the ribosomal binding site 
preferably comprises the nucleotide sequence (SEQ ID NO 5) 

5 ' -ATCCTGGTTCGTACGCGCCGCCTACAAGTGGTGATCTAGGGGAACGTTCCGGGGGCGTCG 
CTGCAACGGCGTCTCCGGATCTGGGTGAGAGGGGAAATCC-3 ' . 
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In a further preferred embodiment thereof, such polynucleotide comprising the promoter 
region and the ribosomal binding site comprises the nucleotide sequence (SEQ ID NO 6) 

5 ' -TCTAGACCAACAACATCTTCAACGTCTACCCGACCAAGATTCAGGAGCCGTCGGCCGAC 
CTGGGCAATGGGATGTACAGCGGGCTTGCGCCGTTCGGCTTCACCGGCGGATCCTGGTTCGT 
ACGCGCCGCCTACAAGTGGTGATCTAGGGGAACGTTCCGGGGGCGTCGCTGCAACGGCGTCT 
CCGGATCTGGGTGAGAGGGGAAATCC-3 ' . 

With respect to prokaryotic host cells, preferably said host cell according to the present 
invention is a GRAM-negative bacterial cell. Preferably such a bacterial cell is selected from 
the group consisting of Escherichia coli (E. coli), Pseudomonas sp., Enterobacter sp., 
Campylobacter sp., and Wreoscilla sp, E.coli being particularly preferred. Preferably, 
derivatives of E. coli K 12 will be used because such strains have a long history of safe use 
and are particularly suitable for periplasmatic expression. Likewise, other types of E. coll, 
e.g. E. coli B derivatives, may be used. 

In a preferred embodiment of the present Invention, the prokaryotic host cell is derived from 
E. coli W31 10 (ATCC 27325). Such a strain has been deposited with the American Type 
Culture Collection (ATCC), 10801 University Boulevard, Manassas, VA 201 10-2209, USA on 
February 28, 2001, under the Designation No. PTA-3132. 

E. coli W3110 (ATCC 27325) and the deposited strain ATCC PTA-3132 can genetically be 
characterized as follows: 

Escherichia coli K-12 [F' mcrA mcrB IN(rmD-nrnE)l lambdal. 

A further aspect of the present invention relates to a process for production of a polypeptide 
of interest, comprising 

(i) providing a prokaryotic host cell transformed with an expression vector which is 
compatible with the host cell, said vector comprising a polynucleotide encoding a fusion 
protein comprising the signal sequence of the gac gene of Pseudomonas diminuta and a 
polypeptide of Interest other than gac of Pseudomonas diminuta, wherein said signal 
sequence and said polypeptide of interest are linked in such a way that upon expression of 
the polynucleotide in a suitable host cell the signal sequence is cleaved off the fusion protein 
and tiie polypeptide of interest is released into the periplasm of the host cell, and 
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(il) culturing the prokaryotic host cell under conditions which cause expression of the 
polynucleotide whereby upon fonnation of the ftision protein the signal sequence Is cleaved 
off the fusion protein and the polypeptide of Interest is released into the periplasm of the host 
cell. 

Optionally, the process according to the present invention ftjrther comprises isolation of the 
polypeptide of interest. 

Examples for suitable polypeptides of interest are those as mentioned above. A prefen-ed 
embodiment thereof relates to such a prokaryotic host cell, wherein the polypeptide of 
interest is an interferon alpha 2. In particular, such Interferon alpha 2 is selected from the 
group consisting of interferon alpha 2A and interferon alpha 2B, the latter being preferred. 

Another preferred embodiment thereof relates to the process according to the present 
invention, wherein said vector is a plasmid, preferably a high copy plasmid. 

The present invention further relates to a process according to the present Invention, 
wherein said signal sequence of the gac gene of Pseudomonas diminuta comprises the 
amino acid sequence (SEQ ID NO 2) 

MLRVLHRAASJ^VMATVIGLAPAVAFA . 

In a further aspect, the present invention relates to the process according to the present 
invention, wherein said vector further comprises a polynucleotide comprising the promoter 
region and the ribosomal binding site of the gac gene of Pseudomonas diminuta, which 
polynucleotide being operatively linked to the polynucleotide encoding the fusion protein 
comprising the signal sequence and the polypeptide of interest. 

Such polynucleotide comprising the promoter region and the ribosomal binding site 
preferably comprises the nucleotide sequence (SEQ ID NO 5) 

5 • -ATCCTGGTTCGTACGCGCCGCCTACAAGTGGTGATCTAGGGGAACGTTCCGGGGGCGTCG 
CTGCAACGGCGTCTCCGGATCTGGGTGAGAGGGGAAATCC-3 • . 
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In a further preferred embodiment thereof, such polynucleotide comprising the promoter 
region and the ribosomal binding site comprises the nucleotide sequence (SEQ ID NO 6) 

5 ' -TCTAGACCAACAACATCTTCAACGTCTACCCGACCAAGATTCAGGAGCCGTCGGCCGAC 
CTGGGCAATGGGATGTACAGCGGGCTTGCGCCGTTCGGCTTCACCGGCGGATCCTGGTTCGT 
ACGCGCCGCCTACAAGTGGTGATCTAGGGGAACGTTCCGGGGGCGTCGCTGCAACGGCGTCT 
CCGGATCTGGGTGAGAGGGGAAATCC-3' . 

With respect to prokaryotic host cells, preferably said host cell according to the present 
invention is an E. coil cell; implications of that have been described in detail above. 

In a further embodiment the present invention relates to the cultivation (or fermentation) part 
of the process utilizing the transformed host cell according to the present invention. 

IVIany factors can influence the productivity of fermentation processes employing 
recombinant organisms. The applied fermentation strategy has to consider the sensitive 
relationships between microbial physiology and plasmid copy number, plasmid stability and 
gene expression which is not only determined on gene's level but also by media composition 
and process conditions. Besides the stability of the host strain, stability of the product is of 
great importance for high level production of recombinant proteins. 

For the development of a fermentation process employing a constitutive expression system, 
growth conditions and recombinant product formation are closely related to each other. 
Therefore the effort for optimal growth and control of growth conditions which are in a close 
relationship to product formation during the whole fennentation run is higher compared to 
induced expression systems. 

Typical E.coli fermentation processes for the production of recombinant proteins are 
characterized by short fermentation times in a range between a few hours and about 100 
hours of cultivation, ly/lost often not only a carbon source but also various complex or 
inorganic nitrogen sources, different salts and trace elements are fed to E.coli cultures. 
Carbon feeding usually follows time-profiles with a stepwise increase or exponential increase 
of the feeding rates. Sometimes also mixtures of carbon sources are used during E.coli 
fermentations. Feeding of the carbon source is usually coupled to the oxygen transfer 
capacity of the bioreactor by the use of various control strategies. 
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A further object of the present invention is to provide a robust and reproducible fermentation 
process, based on the gac expression system and corresponding transformed host cells 
according to the present invention, for high yield periplasmatic expression of a polypeptide of 
interest without the use of extensive nitrogen source feeding and optionally without the 
addition of trace elements to the fermentation media. 

The process according to the invention is earned out In principle by initially cultivating the 
bacterial host cell, i.e. the expression strain, in accordance with microbiological practice 
known per se. The strain is generally brought up starting from a single colony on a nutrient 
medium, but it is also possible to employ cryopreserved cell suspensions (cell banks). The 
strain Is generally cultivated In a multistage process in order to obtain sufficient biomass for 
further use. 



On a small scale, this can take place in shaken flasks, wherein it Is possible In most cases to 
employ a complex medium (for example LB broth). However. It is also possible to use 
defined media (for example citrate medium). For the cultivation, a small-volume preculture of 
the host strain (inoculated with a single colony or with a cell suspension from a cryoculture) 
is grown, the temperature for this cultivation not generally being critical for the later 
expression result, so that it is possible routinely to operate at relatively high temperatures 
(for example SCC or 37°C). The main culture Is set up in a larger volume (for example 
500 ml), where it is in particular necessary to ensure good aeration (large volume of flask 
compared with the volume of contents, high speed of rotation). Since it is intended that 
expression takes place in soluble form, the main culture will In most cases also be carried 
out at a somewhat lower temperature (for example 22-C or 28X). Both inducible systems 
(for example with trp, lac, tac or phoA promoter) and constitutive systems (like the preferred 
system of the present invention comprising the gac promoter) are suitable for producing 
soluble proteins. The resulting cells can be harvested and processed further. 

On a larger scale, the multistage system consists of a plurality of bloreactorB (fennenters), 
preferably employing defined nutrient media in order to be able to Improve the process 
engineering control of the process or employing complex nutrient media In order to enhance 
growth of the microorganism and to increase robustness of the process. In addition, it is 
possible to greatly increase biomass and product formation by feeding of particular nutrients 
(fed batch mode). For example, a preliminary stage fermenter and a main stage fermenter 
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are used. The preliminary stage fermeriter is inoculated witli a so-called inoculum which is 
generally grown from a single colony or a cryoculture in a shake flask. Good aeration must 
also be ensured in the fermenter - and especially in the main stage thereof. The resulting 
cells are once again delivered for further processing. 

Accordingly, in a preferred embodiment the culturing (or cultivation) of the process according 
to the present invention as described herein Is being perfomned as a multi-stage process 
comprising a pre-culture step and a main culture step. In the alternative, a single-step 
femientation without a pre-culture step is possible. In an even more prefen-ed embodiment, 
the process according to the present invention as described herein is being performed as a 
multi-stage fermentation process comprising a shake-flask step, optionally a pre-culture 
step, and a main-culture step. 

In particular, said culturing of the procaryotic host cell in the main culture step is performed 
in a culture medium comprising a substrate for more than about 90% of the cultivation time 
at a substrate concentration lower than the saturation constant of the substrate, 
accompanied by high levels of dissolved oxygen concentration, and further accompanied by 
a steadily decreasing specific growth rate of the bacterial host cells, the process being 
perfonned at a temperature which is lower than the optimum temperature for growth of the 
host cell. 

In this context, the saturation constant Is the concentration of a substrate (In particular the 
cartaon source) at which the host cell is growing at a specific growth rate which is equivalent 
to 50% of the maximum specific growth rate. 

in this context, the specific growth rate is the Increase of biomass concentration in a certain 
time interval divided by the average biomass concentration of said time Interval. 

Preferably, the culture medium of the main culture as well as, where applicable, of the pre- 
culture, is a complex medium comprising a complex nitrogen source, preferably a yeast 
extract, various salts and a carbon source to support initial growth of the host cell. In a 
prefen-ed embodiment thereof, said carbon source Is either added to the main culture 
medium by feeding of said carbon source after Inoculation or, preferably, present in the main 
culture at the time of Inoculation. 
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Preferably, the concentration of dissolved oxygen in the main culture step Is Jhlgher than 
about 20%, more preferably from about 40 % up to about 100% of saturation. 

Preferably, the steadily decreasing growth rate in the main culture step Is from about 2h*^ to 
about 0.001 h'\ 

In a prefen-ed embodiment, the temperature In the main culture step Is between about 22"C 
and about 35'C. preferably between about 25'C and about 3rc, most preferably about 
28''C, 

In another preferred embodiment, said cultivation In the pre-culture and/or main culture is 
performed at a pH value In the range of about 6.7 to about 7.3. 

In a further prefen-ed embodiment of the present invention, the substrate Is glycerol or, 
preferably, a carbohydrate. Preferably, the carbohydrate is glucose. 

As mentioned herein, examples for suitable proteins of Interest are those as mentioned 
above. A preferred embodiment, the polypeptide of Interest is an interferon alpha 2. In 
particular, such interferon alpha 2 is selected from the group consisting of Interferon alpha 
2A and Interferon alpha 2B, the latter being preferred. 

In a most preferred embodiment regarding all aspects of the processes of the present 
invention, the host cell is an E. coll cell. 

The polypeptide of interest can then be isolated by protein purification methods known to the 
skilled person (see. for example, M.P. Deutscher. in: Methods in Enzymology: Guide to 
Protein Purification, Academic Press Inc.. (1990). 309-392). A purification sequence 
generally comprises a cell dismption step, a clarification step (centrifugation or 
microfiltration) and various chromatographic steps, filtratlons and/or precipitations. A suitable 
example for the isolation of a polypeptide of Interest produced in accordance with the 
present Invention is given below. 
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Deposition of Microorganisms: 

E. coll strain W3110 (ATCC 27325) lias been deposited witli the American Type Culture 
Collection (ATCC), 10801 University Boulevard, Manassas, VA 20110-2209, USA on 
February 28. 2001, under the Designation No. PTA-3132. 

The following examples serve to illustrate the present invention, without In any way limiting 
the scope thereof. Subject-matter disdosed In the examples relates to preferred 
embodiments of the present invention. 

Examples 

Example 1: Construction of a host cell strain for production of recombinant human interferon 
alph9 2B (rhlFNa2B) 

1.1 General considerations 

The polypeptide rhiFNa2b (recombinant human lnterferon-a2b) is produced in the 
Escherichia coli K-12 strain W31 10 transformed with a plasmid containing an optimized 
synthetic gene coding for rhlFNa2b. rhlFNa2b is produced under the control of the promoter 
and RIbosome Binding Site (RBS) of the glutaryl 7-ACA acylase gene (gac) from 
Pseudomonas diminuta CCM 3987 by fennentation of recombinant E. coli K-12. rhlFNa2b is 
expressed as an N-temnlnal fusion protein with the signal sequence from the same (gac) 
gene, directing the protein to the periplasm wtth concun-ent processing (cleaving off) of the 
signal sequence. The fennentation process therefore directly yields mature rhlFNa2b with a 
primary sequence Identical to that of naturally occurring human Interferon alpha 2b. The 
expression plasmid Is designated pMG414, the production strain W31 10[pMG414]. 

1.2 Construction of expression vector pMG414 

pUC19 serves as the starting point for the construction of the vector plasmid. pUC19 Is a 
frequenfly used and thoroughly characterized high copy plasmid. It contains a highly efficient 
origin of replication and an ampiclllin resistance {amp orbia) gene (Yanisch-Penron ef a/., 
1985; VIeira and Messing, 1982; GenBank accession numbers L09137 and X02514). 
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Even though pUC19 is frequently used for the construction of expression plasmids, the amp 
gene may not be an ideal selectable marker for industrial purposes. For this reason the 
promoter and the coding region of the amp gene are removed and replaced by the promoter 
and the coding region of the tetracycline resistance gene (fef) from the well known safety 
plasmid pBR322 (Bolivar et al., 1977a, 1977b. 1978; review: Balbds et al.. 1986; GenBank 
accession numbers J01749, K00005, L08654. M10283. M10286. M10356, M10784, 
Ml 0785, Ml 0786, M33694, V01 119). This cloning work Is performed with the help of high 
fidelity PCR techniques. 

To achieve this, the fragment spanning bps 1743 to 679 of pUC19 is amplified using high 
fidelity PCR (Pwo DNA Polymerase system from Roche Biochemlcals) and the following 5'- 
phosphoryiated oligonucleotides: 

Oligo 235: 5'- Phosphate - TAACTGTCAG ACCAAGTTTA CTC -3' (SEQ ID NO 7) 
Oligo 236: 5'- Phosphate - GCGTTTCGGT GATGACGGTG -3' (SEQ ID NO 8) 

The resulting PCR fragment is 1624 bps in length and contains the complete pUC19 
backbone lacking the amp promoter and coding sequence, but including the stop codon and 
transcription terminator fl-om the amp gene. 

As mentioned above, the tet promoter and coding sequence (excluding the stop codon) is 
amplified from pBR322. Again, high fidelity PCR was used to amplify bps 4 to 1273 of 
pBR322. The following 6'-phosphorylated oligonucleotides were used for this amplification: 

Oligo 237: 5'- Phosphate - TCATGTTTGA CAGCTTATCA TCG -3' (SEQ ID NO 9) 
Oligo 238: 5'- Phosphate - GGTCGAGGTG GCCCGGCTC -3' (SEQ ID NO 10) 

The resulting PCR fragment is 1270 bps in length. The two PCR fragments are purified by 
preparative agarose gel electrophoresis and ligated using T4 DNA Ligase (Rapid DNA 
Ligation Kit, Roche Biochemicais). The ligated DNA is purified and electroporated Into 
Escherichia co// K-12 DH10B (Life Technologies ElectroMAX DH10B electrocompetent cells, 
genotype: F mcrA A{mrr-hsdRMS-mcrBS) <|)80d/acZAM 1 5 A/acX74 deoR recAl endAI 
araD139 A(ara, /ea)7697 galU galK rpsL nupG). Transformed cells are plated on to LB 
agar 15 mg/L tetracycline and 3 g/L glucose. Liquid cultures are grown In LB broth 
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containing 15 mg/L tetracycline and 3 g/L glucose and plasmid DNA Is Isolated from these 
cultures using standard miniprep methods. Plasmid DMAs are analyzed by restriction 
analysis for correct integration of the fef fragment Into the pUC19 backbone. Since 
integration of the fragment was unspecific with respect to orientation, only about 60% of all 
insert containing clone had the fragment inserted in the correct orientation, I.e. the tet gene 
running in the same direction as the the amp gene in pUC19. A larger amount of DNA is 
isolated from liquid cultures of a few clones and subjected to more detailed restriction 
analyses. Of those clones showing conect restriclton patterns, one Is selected for further 
cloning work. 

The respective plasmid was designated pMG402. It is identical to pUC19 in all features and 
functions but for the fact that it must be grown on/in tetracydine-containing media instead of 
amplcillin-containing media. This way a tet resistant high copy vector suitable for Industrial 
purposes Is generated. 

Features of plasmid pMG402: 

bps 1 954-680: pUCI 9 backbone (= pUC1 9 lacking the amp promoter and structural gene) 
bps 681-1 953: fef promoter and structural gene firom pBR322 

rhlFNa2b is expressed as an N-termlnal fusion with the signal sequence of glutaryl 7-ACA 
acylase from Pseudomonas diminuta CCM 3987 (gadss = SEQ ID NO 2) directing the 
protein to the periplasm with concun-ent processing (cleaving off) of the signal sequence by 
the host cell's signal peptidase apparatus. 

Amino acid sequence of gadss (27 aa): MLRVLHRAAS ALVMATVIGL APAVAFA 

In the 3' region of the coding sequence of the gadss a Sac II restriction endonuclease site 
is Introduced via the 3' PGR primer creating a silent mutation (amino acid sequence 
unchanged). This Sac II site allows fusion of the gadss coding region with the rhlFNa2b 
gene. 

The structural gene for rhlFNa2b is synthesized chemically. It differs from the natural human 
cDNA sequence in 48 of 165 codons and is designed to eliminate any weak and en-or prone 
codons. 
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In the following table, codon changes are indicated. In the table, "Natural codon" refers to 
the cDNA sequence published by Streuli et aL, 1980 (GenBank Accession Number V00548). 
The amino acid numbers refer to mature hlFNa2b (starting with Cys 1). The amino add 
sequence to be encoded by the synthetic gene is taken from the SwissProt Database, 
Accession number P01563/P01564 (amino acids 24 to 188). 



cxcnangB 


MiTiinO 


iMaiurai 


oynineiiw 


Bxcnange 


Amino 


Naiurai 


syntnetic 


no. 




couon 


couon 


no. 


acid 


codon 


codon 


1 


Cys 1 


TGT 


TGC 


25 


Leu 80 


CTC 


CTG 


2 


Pro 4 


CCT 


CCG 


26 


Leu 81 


CTA 


CTT 


3 


Arg 12 


AGG 


CGG 


27 


Leu 88 


CTC 


CTG 


4 


Arg 13 


AGG 


CGA 


28 


Asn 93 


AAT 


AAC 


5 


Leu 17 


CTC 


CTT 


29 


Cys 98 


TGT 


TGC 


6 


Arg 22 


AGG 


CGG 


30 


lie 100 


ATA 


ATC 


7 


Arg 23 


AGA 


CGA 


31 


Gly 102 


GGG 


GGT 


8 


Ser 28 


TCC 


TCT 


32 


Gly 104 


GGG 


GGT 


9 


Leu 30 


TTG 


TTA 


33 


Thr 106 


ACA 


ACT 


10 


Asp 32 


GAC 


GAT 


34 


Pro 109 


CCC 


CCG 


11 


Arg 33 


AGA 


CGA 


35 


Ser 115 


TCC 


TCT 


12 


Phe 36 


TTT 


TTC 


36 


Arg 120 
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CGA 


13 


Gly37 
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GGT 


37 
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CGG 


14 
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TTC 


38 
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CTG 


15 
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CCG 


39 
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CCG 


16 


Phe 43 


TTT 


TTC 


40 
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TGT 


TGC 


17 


Gly44 


GGC 


GGT 


41 


Arg 144 


AGA 


CGA 


18 


Pro 54 


CCT 


CCG 


42 


Arg 149 


AGA 


CGG 


19 


Val 55 


GTC 


GTA 


43 


Phe 151 


TTT 


TTC 
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Leu 56 
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TTG 


44 


Ser 154 


TCA 


TCT 


21 
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AAC 


45 


TlirISS 


ACA 


ACC 


22 


Leu 66 


CTC 


CTG 


46 


Ser 160 


AGT 


TCT 


23 


Thr69 


ACA 


ACT 


47 


Arg 162 


AGA 


CGA 


24 


Ser 72 


TCA 


TCT 


48 


Ser 163 


AGT 


AGC 
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The resulting gene allows efficient and precise transcription and translation of rhlFNa2b in 
Escherichia coll Since the gene is designed for expression in a bacterial system it does not 
contain any untranslated sequences (introns etc.). 

The structural gene is chemically synthesized. In brief, overiapping complementary 
oligonucleotides about 30 to 50 nucleotides in length are synthesized in a way to cover both 
strands of the structural gene sequence without any gaps. The oligonucleotides are 
hybridized to each other and ligated using T4 DMA Ligase. The reaction product is cut with 
restriction endonucleases and cloned Into the pUC18 vector The resulting plasmid is 
sequenced and shows the correct sequence. 

The synthetic gene on this plasmid does not contain the grac signal sequence. This part of 
the coding region is introduced via the gac fragment containing promoter, RBS and signal 
sequence and fused to the rhlFNa2b structural gene. 

The gac fragment is generated by chemical synthesis. For example, overlapping 
complementary oligonucleotides about 30 to 50 nucleotides in length are synthesized in a 
way to cover the full length of both strands of the gac fragment (including the restriction 
endonuclease recognition sites on both sides plus a minimum of 6 additional basepairs to 
allow efficient cleavage) without any gaps. The oligonucleotides are then hybridized to each 
other (e.g. by heating and subsequent cooling) and ligated using T4 DNA Ligase. The 
reaction product is then cut with the respective restriction endonucleases {Xba I and EcoR I) 
and cloned into the pMG402 vector (see below). 

In the alternative, the gac fragment containing promoter, RBS and signal sequence can be 
amplified from a plasmid comprising such elements like plasmid pKS55, which construction 
is described in CS patent No. 278,515. The gac gene cloned therein has been derived from 
a strain of Pseudomonas diminuta (CCU 3987). Amplification is canried out using a high 
fidelity PGR system. The restriction endonuclease sites needed for cloning are introduced 
via the following PGR primers. 
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Primers: 

1. 5'-Phosphate - GGGGGGTCTAGACCAACAACATCTTCAACGTCTACC -3* 
(SEQ ID NO 11) 

2. 5'-Phosphate - CC CCC CGA ATT CAC TAG TAG GCG TCT CTC TCC -3' 
(SEQ ID NO 12) 

There will be no difference in performance betvAreen a gac fragment generated via high 
fidelity PGR amplification and a gac fragment generated by chemical synthesis. 

The thus created gac fragment has the following nucleotide sequence (SEQ ID NO 13): 

5 ' -GGGGGGTCTAGACCAACAACATCTTCAACGTCTACCCGACCAAGATTCAGGAGCCGTCGG 

CCGACCTGGGCAATGGGATGTACAGCGGGCTTGCGCCGTTCGGCTTCACCGGCGGATCCT 

GGTTCGTACGCGCCGCCTACAAGTGGTGATCTAGGGGT^CGTTCCGGGGGCGTCGCTGCA 

ACGGCGTCTCCGGATCTGGGTGAGAGGGGAAATCCATGCTGAGAGTTCTGCACCGGGCGG 

CGTCCGCCTTGGTTATGGCGACTGTGATCGGCCTTGCGCCCGCGGAGAGAGACGCGTACT 

AGTGAATTCGGGGGG-3 ' 

The gac fragment (either synthetic or created via PGR) and the vector plasmid pMG402 are 
ligated using the Xba I and EcoR I sites. This way the expression vector pMG412 is 
generated. 

The expression vector, pMG412, contains codons 1 - 23 + the first nucleotide of codon 24 of 
the gac signal sequence. Into codons 22-24 the Sac II site is introduced by silent mutation. 
Anything downstream of the Sac il site in pMG412 is primer or vector sequence. 

The last two nt of codon 24 + codons 25-27 are introduced by the forward PGR primer for 
the target structural gene (rhlFNa2B, see above). Such a primer therefore contains the 
following elements: 

Gutting overhang (e.g. 6 nucleotides) - Sac II site - to gcc ttt gcg (SEQ ID NO 14) - 
hybridizing region corresponding to the 5' end of the "mature" target gene. 
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In particular, a suitable primer has tiie following nucleotide sequence (SEQ ID NO 15): 

TT GCG CCCjGCGGTC GCC TTT GCG - liybridizing region (Sac li underiined) 

Tlie last amino acids (24-27) of the gac signal sequence are V A F A (SEQ ID NO 16). 

From Vne plasmid constmct described above the rhIFNa2b gene is amplified using a high 
fidelity PGR system. The 5' PGR primer contains the Sac II site for fusing the gene with the 
gac fragment plus the last four codons of the gac signal sequence. The 3' primer contains 
the TAA (ochre) stop codon and the M/u I site for cloning. The amplification of the Interferon 
alpha structural gene generates the following fragment (SEQ ID NO 17): 

GGGGGGCCGCGGTCGCCTTTGCGTGCGATCTGCCGCAAACCCACAGCCTGGGTAGCCGGC 
GAACCTTGATGCTTCTGGCACAGATGCGGCGAATCTCTCTTTTCTCTTGCTTAAAGGATC 
GACATGACTTCGGTTTCCCGCAGGAGGAGTTCGGTAACCAGTTCCAAAAG6CTGAAACCA 
TCCCGGTATTGCATGAGATGATCCAGCAGATCTTCAACCTGTTCAGCACTAAGGACTCTT 
CTGCTGCTTGGGATGAGACCCTGCTTGACAAATTCTACACTGAACTGTACCAGCAGCTGA 
ACGACCTGGAAGCCTGCGTGATCCAGGGTGTGGGTGTGACTGAGACTCCGCTGATGAAGG 
AGGACTCTATTCTGGCTGTGCGAAAATACTTCCAACGGATCACTCTGTATCTGAAAGAGA 
AGAAATACAGCCCGTGCGCCTGGGAGGTTGTCCGAGCAGAAATCATGCGGTCTTTCTCTT 
TGTCTACCAACTTGCAAGAATCTTTACGAAGCAAGGAATAA TACGCGT GAATTCGGGGGG 

This riilFN(x2b PGR fragment and pMG412 are llgated using the Sac II and Mlu I srtes. This 
way the final production/expression plasmid pMG414 was generated. Both strands of 
pMG414 are sequenced and show no differences to the expected sequence. 

Features of piasmid pMG414 (total size 3668 bps): 

bps 2728-256: pUC1 9 bacl<bone, part 1 

bps 257-546: gac fragment (promoter, RBS, signal sequence) 

bps 547-1 044: synthetic rhlFNcx2b gene (including TAA stop) 

bps 1 045-1454: cloning sites + pUCI 9 backbone, part 2 

bps 1 455-2727: tet gene from pBR322 (promoter/RBS 1 455-1 536, 
coding sequence including TAA stop 1537-2727) 
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Thereby, the gac fragment containing promoter, RBS and signal sequence is fused to the 
rhlFNa2b structural gene - using a restriction endonuclease site at the 3' end of the gac 
fragment introduced by a PCR primer. The same site is fused to the 5' end of the rhlFNa2b 
structural gene, also by the way of a PCR primer. So after cloning both elements (gac 
fragment and rhlFN(x2b structural gene) into the basic vector a gene encoding a gadss- 
rhlFNa2b fusion protein is generated. Of its total 192 codons (576 nucleotides) the first 27 
encode the gac signal sequence not present in the final protein and amino adds 28 to 192 
encode mature rhlFNa2b (165 amino acids, cysteine 1 to glutamic acid 165) 

The nucleotide sequence of the expression cassette used in the rhlFNa2b expression 
plasmid pMG414 (807 bps) (see below) and amino acid sequence of the gac1ss-rhlFNa2b 
fusion protein is shown as follovi^ (SEQ ID NO 18): 

5'-gCTA^CCAACAACATCTTCAACGTCTACCCGACCAAGATTCAGGAGCCGTCGGCCGACC 
TGGGCAATGGGATGTACAGCGGGCTTGCGCCGTTCGGCTTCACCGGCGGATCCTGGTTCG 
TACGCGCCGCCTACAAGTGGTGATCTAGGGGAACGTTCCGGGGGCGTCGCTGCAACGGCG 
TCTCCGGATCTGGGTGAGAGGGGAAATCC 



ATG CTG AGA GTT CTG CAC CGG GCG GCG TCC GCC TTG GTT ATG GCG 
MLRVLHRAASALVMA 



ACT GTG ATC GGC CTT GCG C |CC GCG G| TC GCC TTT GCG 
TVIGLAPAVAFA 



TGC GAT CTG CCG CAA ACC CAC 
C D L P Q T H 

ATG CTT CTG GCA CAG ATG CGG 
M L L A Q M R 

AAG GAT CGA CAT GAC TTC GGT 
K D R. H D F G 

CAG TTC CAA AAG GCT GAA ACC 
Q F Q K A E T 

CAG CAG ATC TTC AAC CTG TTC 
Q Q I F N L F 



AGC CTG GGT AGC CGG CGA ACC TTG 
SLGSRRTL 

CGA ATC TCT CTT TTC TCT TGC TTA 
RISLFSCL 

TTC CCG CAG GAG GAG TTC GGT AAC 
FPQEEFGN 

ATC CCG GTA TTG CAT GAG ATG ATC 
I PVLHEMI 

AGC ACT AAG GAC TCT TCT GCT GCT 
STKDSSAA 



1 
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TGG GAT GAG ACC CTG CTT GAG AAA 
WDETLLDK 

GAG CTG AAC GAC CTG GAA GCC TGC 
QLNDLEAC 

ACT GAG ACT CCG CTG ATG AAG GAG 
TETPLMKE 

AAA TAG TTC CAA CGG ATC ACT CTG 
KYFQRITL 

AGC CCG TGC GCC TGG GAG GTT GTC 
SPCAWEV.V 

TTC TCT TTG TCT ACC AAC TTG CAA 
FSLSTNLQ 

TA2V. 



TTC TAG ACT GAA- CTG TAG CAG 
F Y T E L Y Q 

GTG ATC CAG GGT GTG GGT GTG 

V I Q G V G V 

GAC TCT ATT CTG GCT GTG CGA 
D S I L A V R 

TAT CTG AAA GAG AAG AAA TAG 

Y L K E K K Y 

CGA GCA GAA ATC ATG CGG TCT 
R A E I M R S 

GAA TCT TTA CGA AGC AAG GAA 
E S L R S K E 



T ACGCGT ACTAGT|GAATTCh3 



The sequence as shown is divided into sub-paragraphs / regions which comprise: 

1. the gac promoter and RBS (first paragraph, bps 257 to 465 of pMG414, see below), 

2. the gac signal sequence coding region (second paragraph, bps 466 to 546 of pMG414, 
see below), 

3. the synthetic gene for rhlFNa2b (third paragraph, bps 547 to 1044 of pMG414 (see 
below) - including the TAA stop codon), and 

4. the 3' cloning linker (fourth paragraph, bps 1045 to 1063 of pMG414. see below). 

On the pMG414 these four regions are directly joined to one another. They are separated in 
the figure for reasons of lucidity only. 

The start (ATG) and the stop (TAA) codons of the open reading frame are shown in bold. 
The first (TGC) an the last (GAA) codon of mature rhlFNa2b are underlined. 



The restriction endonuciease sites used for cloning are boxed. These are: 
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- Xba I (TCTAGA) and EcoR I (GAATTC) for the introduction of the gac fragment 
(promoter, RBS. signal sequence, Sac II, Mlu I, Spe I sites) 

- Sac II (CCGCGG) and Mlu I (ACGCGT) for the introduction of the rhlFNa2b PGR 
fragment (including four codons for the last four amino acids of the gadss, the 495 bp 
synthetic gene for mature rhlFNa2b, and the TAA(T) stop codon). 

The gac promoter shows high constitutive / basal activity, the addition of a chemical inducer 
or a physical stimulus (change in culture conditions) is not required. 

1.3 Cloning and establishment of the recombinant cell line 

The expression plasmid pMG414 is introduced into the host strain ATCC PTA-3132 
(=W3110 (ATCC 27325)) by electroporation. Electrocompetent cells are prepared according 
to a standard protocol, electroporation is carried out in 0.1 mm cuvettes at 1800 V using an 
Eppendorf Electroporator 2510. 

After electroporation the reaction is suspended in liquid medium and plated onto agar plates 
containing tetracycline. 

Starting point for selection of a suitable cell clone Is a thus obtained transformation plate. 
Various clones from this plate are grown in liquid culture and cryopreserved as research cell 
banks. Their productivity is tested in shake flask experiments and compared. The best clone 
(El /1 16) is used for further development 

The best clone may show good productivity but relatively poor growth. This poor growth can 
result from various factors, e.g. product toxicity to the host cell, metabolic burden due to 
product synthesis etc. The addition of glucose often brings some improvement because ' . 
glucose downregulates (e.g. by catabolite repression) many promoters used for recombinant 
protein expression. Also, glucose has a general positive effect on the growth of E. coli 
because it can be directly introduced into the metabolism as a carbon source. 

In the case of E1/1 16, a clear positive effect of glucose on growth is observed. The best 
results are achieved with glucose concentrations between 2 and 5 g/L, To adapt the cell line 
to cope with product formation and consequently to better growth in the absence of glucose, 
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the strain is therefore grown in liquid medium in shal<e flaslcs for several passages ("shake 
flask cascade"). 

More specifically, a cryovial of E1/1 16 is thawed and the cell suspension streaked onto 
glucose free LB agar plates containing tetracycline. The plate is incubated at 37*^0 until the 
colonies reach a sufficient size for inoculating a liquid culture. Colonies are transferred from 
the plate into small shake flasks filled with 15 mL of glucose free LB broth containing 
tetracycline. The cultures are shaken at 37**C until they reach an optical density at 600 nm of 
above 0.5 (typically > 1 .0). For this first round this takes up to 48 hours due to the poor 
growth characteristics of the original isolate. 

The procedure described in the above paragraph is performed five consecutive times with 
the liquid culture of the previous round being streaked onto plates and the colonies from the 
plates serving to inoculate the next liquid culture. From each liquid culture optical density is 
determined and a sample was taken for determination of product titer using SDS-PAGE - 
Western Blot. Clones from the culture with the best combination of growth and productivity 
are the used to initiate the next round. 

In the course of the different rounds of this culture cascade (i.e. multiple propagation and 
reisolation steps) the growth characteristics of the (sub)strain(s) gradually improve. By 
choosing the strain with the best combination of growth and productivity In each round, titers 
are also gradually Increased. After the fifth round again single colonies are generated on LB 
agar plates containing tetracycline and used to inoculate a liquid culture containing 
tetracycline for the generation of a primary seed lot (PSL). The culture is grown at 37**C to 
an optical density of about 1.5, mixed with an equal amount of sterile 40% w/v glycerol, 
aliquoted into cryogenic vials and frozen at -80**C. This PSL is used as a starting point for 
the generation of the GMP cell banks (Master Cell Bank and Working Cell Bank) of the 
Interferon alpha 2b production strain. This reisolate is designated El/1 16a. Reisolation 
processes like the one described above have proved to yield reproducible results. 

El/1 16a shows excellent growth characteristics in shake flasks and stirred bioreactors 
(fermenters). An inoculum suitable for starting a bioreactor can be grown in a shake flask 
starting from a Working Cell Bank vial in about 8 hours. 
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A Master Cell Bank is prepared under cGMP conditions from tlie primary seed lot described 
above. In brief, a PSL vial is thawed and plated onto tetracycline containing agar plates. A 
single colony is picked and used to inoculate the Master Cell Bank (MCB) shake flask culture 
(LB broth medium containing tetracycline). The cell supension from the logarithmic growth 
phase is mixed 1+1 with 40 % w/v Glycerol, aliquoted at 1.8 mL into cryogenic vials, sealed 
in cryogenic tubing, and frozen in the liquid phase of a liquid nitrogen tank. 

The Working Cell Bank is generated in the same way as the Master Cell Bank except that 
the shake flask culture is inoculated with cell suspension from a thawed MCB vial. 

Example 2: Fermentation process for production of recombinant human interferon alpha 2B 
(rhlFNa2B) 

The fermentation process is started by growing the strain E.coli K-12 W31 10 obtainable from 
the Working Cell Bank as described above in shake flask cultures in Luria BertanI (LB-) 
medium at SZ'^C with the addition of the antibiotic tetracyclin hydrochloride to avoid growth of 
non-plasmid carrying cells. 

The shake flask culture is then used to inoculate the seed culture (= pre-culture) medium 
(inoculum size = 0,4%). The medium for this pre-culture cultivation is based on deionized 
water containing glucose as a sole carbon source and yeast autolysate as a complex 
nitrogen source. In addition, anorganic salts like KH2PO4, K2HPO4, (NH4)2S04 and 
MgS04.7H20 are added to the medium. As an antifoam agent polypropylene glycole 2000 
(PPG2000) Is used. In particular, the pre-culture medium has the following composition: 

Pre-culture Basal Medium: 



Component 


Amount 


De-ionized water (WBI) 


301 


Yeast autolysate, KAT, OhIy 


21.7 g/l 


Glucose Monohydrate, pure 


25.0 g/l 


Ammonium Sulfate, p.a. 


1.0 g/l 


Potassium Phosphate Monobasic, p.a 


1.5 g/l 
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Component 


Amount 


Potassium Phosphate Dibasic, anhydrous, pure 


3.0 g/l 


Magnesium Sulfate Heptahydrate, p.a. 


0.5 g/l 


Polypropylene Glycole 2000 


0.5 ml/1 



These media components are sterilized together for 20 minutes at 121 X. After cooling of the 
basal medium, an aliquot of a 5 g/L sterile stock-solution of the antibiotic Tetracycline 
Hydrochloride Is added to the basal medium (sterilization is performed by filtration (0.22 
filter)). 



Stock-solution Tetracycline Hydrochloride (5 g/L): 



Component 


Amount 


Tetracycline Hydrochloride cryst., Ph. Eur. 


15mg/l 


De-ionized water (WBI) 





The cultivation-time for seed culture is about 16 hours. During cultivation of the seed culture 
pH-value is controlled to a set-point of 7,0 ± 0,2 with sulfuric acid and sodium hydroxide or 
concentrated ammonia solution. Concentration of dissolved oxygen is kept at levels higher 
than 20% of saturation by increasing the stirrer speed. Stin-er speed at the beginning of the 
cultivation is set to 300 rpm, back-pressure in the vessel to 0,3 bar and aeration rate is 
controlled to 30 Umin (equivalent to "1 wm"). Temperature is kept constantly at 37''C during 
cultivation. As a transfer criterion of broth to the main stage of the fermentation process, an 
increase of the dissolved oxygen concentration after consumption of the carbon source is 
used. 

For main culture cultivation a medium based on deionized water, glucose as a carbon source 
and yeast autolysate as a complex nitrogen source is used. Besides the addition of the 
anorganic salts (NH4)2S04. CaCl2.2H20 and MgS04.7H20, PPG 2000 is used as an antifoam 
agent. The initial glucose is sterilized separatly and added to the sterile rest of the medium. 
Inoculum size to the main fermenter medium was in a range between 0,75 and 3%. In 
particular, the main culture medium has the following composition: 



Main Culture Basal Medium: 



wo 2005/017162 



PCT/EP2004/009067 



-28- 



Component 


Amount 


De-ionized water (WBI) 


601 


Yeast autolysate, KAT, Ohiy 


43.5 g/l 


Ammonium Sulfate, p.a. 


1.0 g/l 


Calcium Chloride Dlhydrate cryst, p.a. 


0.3 g/l 


Magnesium Sulfate Heptahydrate, p.a. 


1.0 g/l 


Polypropylene Glycole 2000 


0.5 ml/I 



These media components are sterilized together for 20 minutes at 121 ^'C. After 
cooling, an aliquot of a 800 g/L separately heat-sterilized glucose stock-solution is added to the 
main culture basal medium (sterilization is perfonmed for more than 30 minutes at 120 °C). 



Glucose Stock-solution, 800 g/l: 



Component 


Amount 


Glucose Syrup 


12.5 ml/1 


De-ionized water (WBI) 





The most important point during this cultivation is the necessity of a complete consumption 
of the initial glucose present In the medium. This leads to a sharp increase of dissolved 
oxygen concentration after about 9 hours of growth. By starting glucose feeding before total 
consumption of the Initial glucose, no product formation is observed. Glucose limitation 
controlled by the feeding of the glucose-solution at a constant rate is therefore very 
important. The temperature during cultivation is controlled to a constant value of about 28**C. 
The initial stirrer speed is set to 300 rpm, the aeration rate Is controlled to 100 L/min 
(equivalent to "1 wm") and the back-pressure in the vessel is set to 0,3 bar. The pH-value is 
controlled to 7,1 ± 0,3 with sulfuric acid and sodium hydroxide or concentrated ammonia 
solution. A peak of the pH-value up to 8.0 after consumption of the initially supplied glucose 
Is acceptable. 

The concentration of the dissolved oxygen is controlled to values higher than 20% of 
saturation. Dependent on the oxygen transfer capacity of the bioreactor DO-concentration is 
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kept at levels higher than 20% of saturation, preferably between about 40 % and 100% of 
saturation, by first increasing the stirrer speed to a maximum value. If this is not sufficient, 
first aeration rate and after that bacl<-pressure is increased, respectively. After a cultivation 
time between 48 and 192 hours (linear increase of product formation is observed with 
cultivation time) the culture is harvested and cooled to 15 ± 5 *'C and conditioned for 
downstream processing by the addition of sucrose/EDTA to the cooled broth. 

The results of a fermentation batch is analysed based on the Westernblot technique or on 
HPLC-measurements after laboratory or pilot plant periplasmatic extraction of the product. 

Example 3: Cell disruption and extraction 

A fermentation broth obtained as described above containing host cells comprising the 
expressed interferon alpha 2B in the periplasmic space is adjusted with sulfuric acid to pH of 
5,0 ± 0,1 immediately after the fermentation and cooled down to 4*^0 ± 2 "^C. The low pH and 
the low temperature help to inactivate endogenous proteases. 

The fermentation broth is adjusted to 10**C to 20^*0, then without any concentration or 
washing of the cells, solid or liquid sucrose (200g sucrose/kg fermentation broth) and EDTA 
(concentration lOmM) are added and the pH adjusted to 8.0. After a selective one-step cell 
permeation protocol using osmotic shock (1+3 dilutions) with cooled water, whereby the 
fermentation broth comprising sucrose and EDTA is poured or pumped into the cooled 
(temperature about 4X) water, the released periplasmic extract is clarified. 
Polyethyleneimine is added to a final concentration of 0,05 % and the pH is adjusted to 
about 7,5 with acetic acid. After 15 to 45 minutes cell debris and DNA flocculate, leaving a 
clear crude extract containing interferon which may be subject to centrifugation to Improve 
clarity. 

This procedure leads to a clear periplasmic extract comprising the desired interferon alpha 
2B in high yield with a purity of > 20 % with respect to the total protein content. 
Polyethyleneimine helps to separate the cell debris from the soluble protein extract leading 
to a very pure Interferon solution. 
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Example 4: Chromatographic purification of recombinant human interferon a 2B (rhlFNa2B) 

4.1 Capture by Cation Exchange Chromatography (CEX) 

After pH adjustment to 4.8 - 5.2 with acetic acid and a filtration step using a 0.3 micron filter, 
the cnjde extract of Example 3 is applied to the CEX column (S ceramic HyperD F 
(Biosepra)). After a washing step with an equilibration buffer (20mM sodium acetate and 
70mM NaCI at pH 5.0) the interferon is eluted with a step gradient at 175ml\/l NaCI. The 
fraction collected is immediately processed by the process step of Example 4.2. 

4.2 Anion Exchange (AEX) Chromatography 

The fraction from Example 4.1 is adjusted to a pH of 7.3 to 7.7 with sodium hydroxide, 
diluted and purified with water to a conductivity of 3.5 to 4.5 mS/cm and applied to the AEX 
column (Q ceramic HyperD F (Biosepra)). After washing, the interferon is eluted with a linear 
salt gradient (0"300mM NaCI) at about 150mM NaCI. Fractions are collected that have a 
purity of greater than or equal to 90 area % according to IPC reversed-phase HPLC and 
used directly in the next step (see Example 4.3). 

4.3 Hydrophobic Interaction Chromatography (HIC) 

The fraction of Example 4.2 is diluted (1:1) with a stock solution of sodium sulphate (0.5% 
sodium sulphate), adjusted to pH 7.3 to 7.7 with NaOH or HCI and applied to the HIC column 
(Source 15PHE (Pharmacia). After washing, the interferon fraction of Example 3 is eluted 
with a linear sodium sulphate concentration (800 - 0 mM sodium sulphate) at about 400mM 
sodium sulphate. The fractions collected that have a purity of greater than or equal to 93 
area % and no impurity greater than or equal to 3% according to IPC reversed-phase HPLC 
are used directly in the next purification step. 

4.4 Cation Exchange Chromatography (CEX) 

The collected fractions of Example 4 are diluted with water to a final conductivity of 7.5 to 8.5 
mS/cm, adjusted to pH 4.3 to 4.7 with 99 to 100% acetic acid and applied to the CEX column 
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(Toyopearl SP-650 S (TosoHaas)). After a washing step, the interferon is eluted in a linear 
NaCI gradient (0 - 300 mM NaCi) at about 250ml\/l NaCI. The fractions are collected that 
have a purity of greater than or equal to 95 area % and no impurity greater than or equal to 
3% according to IPC reversed-phase HPLC and are used directly in thfe next purification 
step. 

4.5 Size Exclusion Chromatography 

The last purification step is a gel filtration step to remove dimers and other aggregates and 
to perform a buffer exchange for the final formulation. The Superdex 75 pg used in this step 
shows a good resolution even at a high load volume (5 % - 15 %). The SEC is performed in 
25mM sodium phosphate and 130mM NaCI O.SmM EDTA at a pH of about 7.3 to 7.7. 

The fractions with the highest purity (> 95 % main peak in RP-HPLC and no side peak > 3 
%) are pooled to give the final bulk solution comprising the desired recombinant human 
interferon a 2B in pure form in a high yield. 



